A Flexible Example Annotation Schema: Translation Corresponding Tree Representation

نویسندگان

  • Fai Wong
  • Dong-Cheng Hu
  • Yu-Hang Mao
  • Ming-Chui Dong
چکیده

This paper presents work on the task of constructing an example base from a given bilingual corpus based on the annotation schema of Translation Corresponding Tree (TCT). Each TCT describes a translation example (a pair of bilingual sentences). It represents the syntactic structure of source language sentence, and more importantly is the facility to specify the correspondences between string (both the source and target sentences) and the representation tree. Furthermore, syntax transformation clues are also encapsulated at each node in the TCT representation to capture the differentiation of grammatical structure between the source and target languages. With this annotation schema, translation examples are effectively represented and organized in the bilingual knowledge database that we need for the Portuguese to Chinese machine translation system.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Example-Based Machine Translation Based on the Synchronous SSTC Annotation Schema

In this paper, we describe an Example-Based Machine Translation (EBMT) system for EnglishMalay translation. Our approach is an examplebased approach which relies sorely on example translations kept in a Bilingual Knowledge Bank (BKB). In our approach, a flexible annotation schema called Structured String-Tree Correspondence (SSTC) is used to annotate both the source and target sentences of a tr...

متن کامل

The Parsing Algorithm of Translation Corresponding Tree (TCT) Grammar

In machine translation (MT), parsing acts as a kernel step to analyze and acquire the syntactic information of an input sentence for the purpose to reproduce the corresponding translation in target language according to the syntactic relationships between the source and target sentences. The parsing process is guided by a set of language formalism, and the design of such algorithm is highly dep...

متن کامل

A Flexible Example-based Parser Based on the Sstc"

In this paper we sketch an approach for Natural Language parsing. Our approach is an example-based approach, which relies mainly on examples that already parsed to their representation structure, and on the knowledge that we can get from these examples the required information to parse a new input s e n t e n c e . In our approach, examples are annotated with the Structured String Tree Correspo...

متن کامل

A Flexible Example-Based Parser Based on the SSTC

In this paper we sketch an approach for Natural Language parsing. Our approach is an example-based approach, which relies mainly on examples that already parsed to their representation structure, and on the knowledge that we can get from these examples the required information to parse a new input s e n t e n c e . In our approach, examples are annotated with the Structured String Tree Correspo...

متن کامل

Application of Translation Corresponding Tree (TCT) Annotation Schema for Chinese to Portuguese Machine Translation

In Example Based Machine Translation (EBMT) research, there are three main approaches: Surface Based, Pattern Based and Structure Based approach. In Structure Based EBMT system, such as SSTC approach [1], it has a problem that it relies on two syntax parsers to analyze the translation examples, but robust syntax parsers are not always available. On the other hand, Chinese and Portuguese belong ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004